Using Message Semantics to Reduce Rollback in Optimistic Message Logging Recovery Schemes
نویسندگان
چکیده
Recovery from failures can be achieved through asyn-chronous checkpointing and optimistic message logging. These schemes have low overheads during failure-free operations. Central to these protocols is the determination of a maximal consistent global state, which is recoverable. Message semantics is not exploited in most existing recovery protocols to determine the recoverable state. We propose to identify messages that are not innuential in a computation through message semantics. These messages can be logically removed from the computation without changing its meaning or result. We show that considering these messages in the recoverable state computation gives rise to re-coverable states that dominate the recoverable state deened under conventional model. We then develop an algorithm for identifying these messages. This technique can also be applied to ensure a more timely commitment for output in a distributed computation.
منابع مشابه
Using Message Semantics for Fast-Output Commit in Checkpointing-and-Rollback Recovery
Checkpointing is a very effective technique to ensure the continuity of long-running applications in the occurrence of failures. However, one of the handicaps of coordinated checkpointing is the high latency for committing output from the application to the external world. Enhancing the checkpointing scheme with a message logging protocol is a good solution to reduce the output latency. The ide...
متن کاملManetho: Transparent Rollback-Recovery with Low Overhead, Limited Rollback, and Fast Output Commit
Manetho is a new transparent rollback recovery protocol for long running distributed computations It uses a novel combination of antecedence graph maintenance unco ordinated checkpointing and sender based message logging Manetho simultaneously achieves the advantages of pessimistic message logging namely limited rollback and fast output commit and the advantage of optimistic message logging nam...
متن کاملEfficient Transparent Optimistic Rollback Recovery for Distributed Application Programs
Existing rollback-recovery methods using consistent checkpointing may cause high overhead for applications that frequently send output to the “outside world,” since a new consistent checkpoint must be written before the output can be committed, whereas existing methods using optimistic message logging may cause large delays in committing output, since processes may buffer received messages arbi...
متن کاملA Fast Rollback-Recovery Scheme based on Optimistic Message Logging
This paper presents an eecient rollback recovery scheme based on the optimistic message logging. To speed up the recovery process, the rollback point of the failed process is broadcast and other processes asynchronously make the rollback decision based on the vector time. Asynchronous recovery process usually causes two possible problems: One is the message delivered from an invalid state inter...
متن کاملMinimizing Timestamp Size for Completely Asynchronous Optimistic Recovery with Minimal Rollback
Basing rollback recovery on optimistic message logging and replay avoids the need for synchronization between processes during failure-free execution. Some previous research has also attempted to reduce the need for synchronization during recovery, but these protocols have suffered from three problems: not eliminating all synchronization during recovery, not minimizing rollback, or providing th...
متن کامل